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Abstract — Wideband communications are impossible with sig- 
nals that are spread over a very large band and are transmitted 
over multipath channels unknown ahead of time. This work 
exploits the I-mmse connection to bound the achievable data- 
rate of spreading signals in wideband settings, and to conclude 
that the achievable data-rate diminishes as the bandwidth in- 
creases due to channel uncertainty. The result applies to all 
spreading modulations, i.e. signals that are evenly spread over 
the bandwidth available to the communication system, with SNR 
smaller than log(W/L)/(W/L) and holds for communications over 
channels where the number of paths L is unbounded by sub- 
linear in the bandwidth W. 

I. Introduction 

This work analyzes the performance of wideband communi- 
cation systems, we characterize the SNR regime that does not 
allow communications in the wideband limit. The underlying 
reason for the inability to communicate is channel uncertainty, 
and our result applies to channels where the number of 
multipath components is unbounded and sub-linear in the 
bandwidth. Our proof is based on the I-mmse connection [3] 
and hinges on a calculation of the minimum mean square error 
(mmse) estimate of the unknown channel. 

We consider signals that spread their power over the entire 
bandwidth, such as PPM or impulse radio [14], where the 
pulse shape and duration determine the bandwidth, or direct 
sequence spread spectrum, where the chip duration determines 
the frequency spread of the signal. Our result applies also 
to OFDM-type signals, if the entire available bandwidth is 
used concurrently. Examples of signals that are not spread over 
bandwidth are FSK and multi-tone FSK [6], [5], where each 
symbol concentrates power on a small span of frequencies, 
although the entire range of symbols may span a very large 
bandwidth. 

Our result shows that channel uncertainty is detrimental to 
spreading systems operating over multipath channels where 
the number of apparent paths is unbounded but sub-linear 
with the system bandwidth [11], [10], essentially because 
of the signal uses too many eigen-modes of the channel. 
Our model of channel variation in time is a simplistic block 
coherent one, where the channel is fixed for known lengths 
of time (coherence periods) and realizations over different 
coherence periods are IID. This channel model offers the 
advantage of eigen-modes that are particularly simple, as 
harmonic signals are eigen-modes of any linear time invariant 
channel. The channel uncertainty a communication system 
faces when operating over the block-coherent channel is thus 
limited to the eigen-values of the channel, or in other words to 



the complex channel gain over the frequency band the system 
uses. 

The essential feature of spreading signals that renders them 
ineffective over wide bands, is that they use the entire range 
of channel eigen-modes concurrently, and are thus exposed to 
uncertainty of a large number of parameters (channel eigen- 
values). Modulation schemes of the FSK type, that exploit a 
small number of channel eigen-modes per symbol, are exposed 
to uncertainty in only a small number of parameters. 

Our result can be extended to more complex channels, 
where the variation in time is described using the Doppler 
spectrum rather than by block-coherence [9], [12], [2]. The 
eigen-modes of such channels are approximately given by 
orthogonal Weyl-Heisenberg bases [4] in the under-spread 
case, i.e. when the channel's response is highly concentrated in 
the delay-Doppler plain. The essential feature that determines 
whether communications are possible in the wideband limit 
is the spreading of symbol power over the unknown channel 
eigen-modes. In channels where the eigen-modes are not 
known in advance, it is very difficult to overcome channel 
uncertainty. 

Related work has shown that pulse-position modulation 
(PPM) systems are unable to handle uncertainty in the delays 
of multipath components [7], [8]. We showed that the receiver 
is unable to detect the channel paths if the bandwidth is large 
enough, whether it uses a threshold detector [7] or a maximum 
likelihood detector [8]. This work extends the scope of past 
results to a wider family of signals, and makes a statement on 
the achievable rate. 

The comparison of our result to that of Telatar & Tse [13] is 
also interesting. The T&T results discussed continuous signals, 
i.e. effectively using a signal to noise ratio (SNR) that inversely 
depends on the system bandwidth SNR = 9 (^), whereas 
our result is more general in the sense that it specifies the 
range of SNR dependencies on bandwidth that does not allow 
communications in the wideband regime. In other words, our 
result applies to impulsive systems and implicitly indicates the 
minimal level of impulsiveness that allows communications in 
the limit of large bandwidth. 

Another point of comparison of our result to that of [13] 
is the type of signals to which they apply. T&T consider two 
types of spreading signals: IID complex signals and signals 
with a very low cross-correlation: ^\ XiXi- n ~ const that 
does not depend on the length of the vector. Our result applies 
to a wider family of signals, where the cross correlation may 



as high as 6 I %/ W ) . This is significant because spreading 
signals generated by IID or pseudo-random sequences have 
an empirical correlation that varies as 9 I \/W J . 

The results in [8], that connect the number of channel 
paths to the level of impulsiveness are also relevant here. The 
impulsiveness parameter there is easily translated to the SNR 
per active burst of transmission, and the results in [8] basically 
determine a lower bound on the SNR, above which the channel 
uncertainty penalty is insignificant in the wideband limit. Our 
new result involves an upper bound on the SNR, below which 
the channel uncertainty penalty prevents communications. 

II. Model 

We consider communication systems with a (single sided) 
bandwidth W, operating over block-coherent multipath chan- 
nels. We use a real discrete model of the system, after sampling 
at the receiver at rate W . The model over a single coherence 
period is given by 



Y = VSNR X-kH + Z 



(1) 



where Y is the received signal over an entire coherence period 
of length T c , this is a vector with K c — T C W entries. The 
vector X of length K c and average energy K c represents the 
transmitted signal, the multipath channel is represented by the 
vector H of length K c and * marks a convolution. Z is white 
standard Gaussian noise (IID with zero mean and variance 
one) and SNR is the signal to noise ratio. The SNR can be 
understood as the signal to noise ratio per frequency resolution 
bin or per degree of freedom. We neglect in (Q]) the edge effects 
at the beginning of the coherence period. 
We impose a probabilistic energy constraint: 



P(\\X\\ 2 >(l + o(l))K c 



w- 







(2) 



Note that IID signaling satisfies this assumption. 

The transmitted signal may be impulsive, we consider the 
signal to noise ratio during active transmission so there is no 
need to explicitly address the impulsiveness used by the system 
(i.e. the duty cycle ratio). We assume that the transmitter 
does not use information on the channel realization, and the 
transmitted signal does not depend on it. 

The transmitted signal X is wideband: its empirical auto- 
correlation is upper bounded by 



X\Xi 



< B 4 \/K, 



i^j 



j = l,...,K c (3) 



with B4 is a constant that does not depend on the bandwidth. 
The notation < , > is used for the inner product of vectors, 
and the notation X % is used for a vector X that is cyclicly 
shifted by i positions, i.e. 



X 1 = 



X 
X 



(2-*) 



\ 



(4) 



V X (K e -i) / 

where ( — ) indicates a mod K c difference. 



The channel is composed of L paths, each with a delay 
in the range [0, T^W] where Td is the delay spread. The 
channel is block-constant with coherence time T c , i.e. it has 
IID realizations over different coherence periods. We assume 
Td <C T c and thus justify to an extent our loose treatment of 
edge effects at the beginning of each coherence period. We 
approximate ([TJ with a circularly-shifted matrix: 



Y = VSNR xH + Z 



(5) 



where 



( ^ 

X 2 



X Kc 

Xi 



Xo 



Xk c 



\ x Kc ■ 

= ( X° X 1 



x 2 X x j 



X 



Ka-1 



(6) 



The channel model is real, L channel gains are IID and 
zero mean, with variance 1/L, so the energy in the channel's 
impulse response equals one on average. We assume an upper 
bound on path gains \Hi\ > Ml, with a constant B\ that does 
not depend on the bandwidth. The choice of the L non zero 
taps is uniform over the ( L c ) possibilities. 

The number of paths L diverges as the bandwidth increases 
in a sub-linear manner [11], [10], i.e. L 

L/W > 0. 
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We make a probabilistic assumption on the channel's re- 



sponse 



P 



Kr 



E 



H, 



X\Xi\ 



w- 



■ B, y K, I ^0 1 = 1,2, 

(7) 

with a constant B3 that does not depend on the bandwidth. 
The typical value of the correlation in (Q is \/K c , so this 
assupmtion is a natural one. By taking a large constant B3 we 
ensure that our result holds for almost all values of i. 

III. Result 

Theorem 1: Communication systems modeled by ([5j, that 
use spreading signals and operate over multipath channels 
as described in Section [II] with SNR <C °%jii j have a 
diminishing rate in the limit of large bandwidth: 

I(Y;x) 



i^ c SNR w^c 







with probability 1. 

We prove the theorem in Section [V] by showing that 



lim 
w^oo i/OSNR 



(Y;H\x) 



and applying 



I(Y;x) = i(Y;H,x)-I(Y;H\x\ 



< -K C SNR - I ( Y; H\x 



(8) 
(9) 



,K r 



I(Y;H,x) upper bound 




where mmse (SNR ) is given by 



log(W/L)/(W/L) 



Fig. 1, A sketch of the I-SNR relationship for spread signals for a very 
large bandwidth. The coherent datarate upper bound (top graph) is linear in the 
low SNR regime, and convex. The channel uncertainty penalty in the bottom 
graph is linear for low SNR values, and saturates at the channel's entropy. 
The incoherent datarate (middle graph) is not convex. 



The term I (Y; H\x) in (0 is the datarate penalty due to 
channel uncertainty. 

IV. Discussion 

The proof of Theorem Q] is based on calculating the mmse 
estimate of the channel response H, given the transmitted and 
the received signals. We show that this mmse estimate is a 
vector with an o(l) norm in low SNR conditions, essentially 
because the noise Z overwhelms the information carrying 
signal. 

Theorem Q] shows that in the wideband limit, the low SNR 
regime can be divided in to parts: 

SNR « ***£. (10) 

K c /L 

where spreading signals are not effective, and 

SNR>^|^ (11) 

K c /L 

where although the SNR diminishes in the limit, it enables a 
positive datarate. 

The channel uncertainty penalty is upper bounded by the 
channel entropy, and the bottom graph of Figure Q] thus 

saturates at SNR = ° E K f£ \- j[, where the first part 

corresponds to he entropy of the paths' delays and the second 
to their gains. 



V. Proof of Theorem Q] 

The proof is based on the I-mmse connection, in particular 
Theorem 2 of [3], This theorem gives a simple formula to 
the achievable rate of communications over a known vector 
channel in terms of the error of the mmse estimate of the 
transmitted signal. We reverse the roles of H and x in our 
usage of Theorem 2 of [3], i.e. consider x as known and H as 
the estimated party. Using our notation, the I-mmse connection 
says that as long as the vector H satisfies E||iJ|| < oo we 
have 

/ , N i r SNR 

I \H; VSNR xH + Z) = - mmse (SNR ) dSNR 

(12) 



mmse (SNR ) = E 



xH - xH (Y; SNR ) 



(13) 



and H is the mmse estimate of H given both x and Y. We will 
show that the mmse estimate is a vector with an o(l) norm in 
low SNR conditions, thus the minimal mean square error (fT3l 
converges to K c , the mutual information (fT2l converges to 
iX c SNR and © diminishes. 
The mmse estimate is given by 



H = E[H\Y,x] 



(14) 



We lower bound the mmse by calculating the minimum mean 
square error in a system that is given additional information 
on H, namely which of its positions satisfies 0. 



H' = E H\Yx,I, 



(7) 



(15) 



where Ifj\ is a list of indices {i} where Hi satisfies 01. The 
additional information can only reduce the mmse. 



H' = 



f f ... f Hf(H\Y,x)dH 1 dH 2 ...dH L (16) 

The conditional probability density in ( TToT ) is manipulated 
using the independence of the transmitted signal from the 
channel. 

~, = ff...fHf<y\x,H)f(H)dH 1 dH 2 ...dH L 

Jf...ff(Y\x,H)f(H)dH 1 dH 2 ...dH L ( 

The conditional probability density in ( TPTT i is Gaussian as 
Y\x,H — J\f(xH,I). We denote by f s ( ) the probability 
density of a K c long vector of IID standard Gaussian variables 



/s(5)= (^^ exp H 115112 



(18) 



and proceed to examine the components of the vector H'. 
Consider first positions (indices) j where Hj = 0. At these 
positions, any non-zero value of H'- increases the estimation 
error and can be disregarded in the calculation of a lower 
bound on the mmse. Let us examine the positions where Hi ^ 
and (0 holds, and look at the estimates of each such value: 



H[ =//.../ Hif s (Vsmx^H-Hj+Z 

f (H) dffi dH 2 ... dH L I 



J J... J f s (VSNRx(H-H 
f (H) dHi dH 2 ... dH L 



Z 



(19) 



We prove that the mmse estimate ( fl9l l is very small by 
comparing the integral in the nominator to the integral in 
the denominator, that effectively sums over a bigger group of 
assignments of H. Assuming (0 we show that the nominator 
of dT9l is negligible when compared to its denominator. 

We first approximate both integrals in (fl9b by sums over 
sampled groups of values of each positions in the vectors H. 



The sampling is done over a tight enough grid that the resulting 
errors are small. 

An upper bound to $1% is calculated by breaking the sum 
in the denominator to a series of sums over groups of values 
of H, where each group corresponds to a single assignment of 
H in the nominator. The set B consists of assignments with a 
non-zero value H t ^ 0. Rewriting the discrete approximation 
of ( fT~9b we get 



m 



Eheb h Js (Vsnr x (h-h)+z)p (H) 



E 



GeB+B" 



fs[VSNRx(H-G) +Z)p(G) 



(20) 



the notation G was introduced to improve clarity we define 
p(H) = f(H)AH L and AH is the sampling step of each 
dimension of the vector H. 

We proceed to divide the entire range of vectors G into non- 
overlapping subgroups, such that for each H £ B we have a 
corresponding subgroup A{H), such that 



J(H) 



HifAxlH-H) +Z)p{H) 



E 



GeA(H) ■ 



f a lx(H-G +Z p(G) 



(21) 



diminishes in the limit of large bandwidth. The convergence 
of < f20b to zero follows directly, as the nominator of ( l20l > is a 
sum of the nominators of J(H ) for all H £ B. 

The subgroups A(H) are created randomly. For each as- 
signment of G that has a non-zero value in the i th position, 
it is put in the subgroup A(G). For a vector G with G t = 
we (uniformly) choose one of its non-zero taps and replace it 
to the i th position. To clarify the process, let us say that the 
j th position of the vector G was chosen. We calculate a new 
vector H by 



Hi = Gj ; ^ = ; H k = G k for k £ i, j 



(22) 



and assign the vector G to the subgroup A(H). 

Each group A(H) contains H and about (K c — L) / L other 
members, each different from H in exactly two positions. We 
ensure that groups' sizes do not deviate significantly from 
(K c — L) / L by relocating memebers from large groups into 
suitable smaller ones. 

We denote by H l ^ k a member of A(H) that differs from 



H by exchanging the values in its i and k positions, and 
define H 1 ^ 1 = H. The set JC(H) holds the values of k such 
that H l ^ k e A(H). 

The terms p (H) in the nominator of (|2TT i and p (G) in the 
denominator are identical for all members of the group A(H) 
because of our assumptions on IID gains and a uniform spread 
of the path delays. 



Hi exp 



J(H) 



Y - VSNRxH 1 



Z^keK(H) ex P 



Y - VSNR xH l 



(23) 

The denominator of d23l contains a sum over about K c /L 
exponents with different values of k, including k = i and 



the nominator holds a signal such factor with k — i. We 
take a close look at their exponent and introduce the notation 
I(Hi,k) for a iT c -long vector with the value Hi at the k th 
positions and zeros elsewhere. 



Y - VSNR xH l 

\Y\\ 2 



VSNR x (H l ^ k - I (Hi,k)) 



VSNRxI(H t ,k) 
/VSNR x (H l ^ k - I (Hi, k)) 
VSNRxI(Hi,k)\ 

k 



(24) 
(25) 
(26) 



(27) 

VSNR xH , VSNR x (W^ k - 1 (H h k))\ (28) 

( VSNR xH, VSNR xl {H u k)\ (29) 

k -I{H ll k))) (30) 

(31) 



Z, VSNR x (H 
Z,VSNRxI{Hi,k) 



we now deal with each line (l24l-([3T|i separately, to show that 
the nominator of ( f23l is much smaller than the denominator. 

{[24[ : The term — | ||Y|| does not depend on k. 

f |25l >, ( [251 ), t30t : These terms do not depend on k because 
the vectors H % ^ k — I (Hi, k) are identical over k e JC(H). 



The term 



VSNR xI(Hi,k) depends on 
k, but the norm is constant over k £ K(H). The vector 
/ (Hi, k) essentially extracts a single column of the matrix x 
and multiplies it by H b . The matrix x is circularly symmetric 
and thus ( f26b is fixed. 
((27l): The term 



a k 



= - (\/SNR x (H l ^ k - I (Hi,k)), 



VSNR xl (H h k) 
-SNR Hi Y^ H J 

3 = 1 J"#fe 



xi- 1 , X 



fe-1 



is significantly smaller than J29l ) at the nominator of 

in other words an order of magnitude smaller than K ^ R . We 

prove this by calculating 



m = -SNR H 



E 



HiiXi- 1 ^ 1 



(32) 



rrr. 



The typical value is on the order of Si SNR ^=& and a loose 

upper bound is given by Using condition (|7]i on H, that 
basically ensures a low correlation in d32l) . we have 



|oi| < BxBsSNR 



(33) 



i29\ : The term 

b k = /VSNR xH, VSNR xl (H h k) 

K c 



SNR H % Y^ Hj 



Xi~ l ,X 



k-l 



is the dominant term in the nominator of 



A",. 



i.e. for k = 



b t = SNRH t Y^ Hj (Xi- x ,X 



(34) 



The dominant term in the sum is the 1 

< (1 + o(l)) K c with high probability 

K r 



' '' ' where 



I'- 1 ,!' 



\h\ < B{ (l + o(l)) SNR — - + B1B3SNR 

Li 

= B\ (l+o(l)) SNR ^ 



VI 



the last approximate equality is tight in the limit of large 
bandwidth. 

(ED- The term c k = /z, VSNR xl (H u k)\ is the 
dominant term in the denominator of (1231 . The sum of 
exponents of {c k } is lower bounded by a single exponent with 
k* E JC(H). We use asymptotic order statistics to show that 
there is k* e K{H) such that c k * = J^f^. J 2 log ^ in 
the limit. To prove the existance of k* we examine the joint 
probability density of {c k }. These are mutually Gaussian zero 
mean random variables, with variance 

var (cfc) = ff 2 SNR (j& ,X*\ < HfSNR (1 + o(l)) K c 

and covariance 

cov (c k , c m ) = #?SNR (x*,X™\ < iJfSNRB 4 V^ 



/ 

We collect {c fc } into the vector C of length M = \K.(H)\ and 
mark its corrlation matrix by R c . R c is positive definite, it has 
a constant and large value on its diagonal, and significantly 
smaller values off-diagonal. 

The mean and variance of the maximal of M IID ~ 
N (0, o" 2 ) random variables are given by [1]: the mean equals 

lnlnM + m27r-2C / 1 



a V2 In M 



and the variance is 



2\/21nAf 



O 



O 



InM 



12 In M 



ln 2 M 



(36) 



(37) 



where C ~ 0.5772 is Euler's constant. Note that for a large 
M the variance diminishes. These results cannot be directly 
applied to the maximal {c k } because these variables are 
correleated. We show that the correlations among {c k } are 
insignificant in the limit of large bandwidth in the sense that 
there is a c k * that is very similar to the maximal of IID 
Guassians, and coclude that in the limit 



Cfc* 



(1 + o(l)) J Hf SNR K c \/2lnM (38) 



In the nominator of d23l ) the term a is insignifi cant, it has 
zero mean and a small variance on the order of \/ Kc ^ fR . 

To summarize the discussion of d24"ii-(l3"Tl). we can upper 
bound d23l in the limit of large bandwidth using the significant 
terms in the nominator and denominator: 

Hi exp ■ 



J(H) < 



Y-VSNRxH^ 



max fee/c(ff) exp 
w Hi exp {a,i + b % - c fc * } 

<r rr Jo R 2^SNR 
< Hi exp < 3B( ■ 



Y - VSNR xH^ k 



(39) 



L 



K C SNR 



2 log- 



AV 



(40) 



and for SNR 



i°g¥ 



the exponent of J401 diverges 



(35) to — 00 as the bandwidth increases, and J(H) 



W- 



Replacing K c by WTq, the proof of Theorem[T]is complete. 
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